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Abstract 


Porcine epidemic diarrhea virus (PEDV) belongs to the genus Alphacoronavirus of the family Coronaviridae. 
PEDV was identified as an emerging pathogen in US pig populations in 2013. Since then, this virus has 
been detected in at least 31 states in the USA and has caused significant economic loss to the swine indus- 
try. Active surveillance and characterization of PEDV are essential for monitoring the virus. Obtaining 
comprehensive information about the PEDV genome can improve our understanding of the evolution of 
PEDV viruses, and the emergence of new strains, and can enhance vaccine designs. In this chapter, both a 
targeted amplification method and a random-priming method are described to amplify the complete 
genome of PEDV for sequencing using the MiSeq platform. Overall, this protocol provides a useful two- 
pronged approach to complete whole-genome sequences of PEDV depending on the amount of virus in 
the clinical samples. 


Key words Porcine epidemic diarrhea virus, Targeted amplification, SISPA, Whole-genome sequenc- 
ing, MiSeq 


1. Introduction 


Complete genome sequencing and genetic analysis significantly 
improved our understanding of the evolution and relationship of 
porcine epidemic diarrhea virus (PEDV) strains worldwide. The 
first PEDV whole-genome sequence was completed for the proto- 
type strain CV777 in 2001 [1]. Since then, several PEDV strains 
have been sequenced and now over 170 whole-genome sequences 
have been deposited in GenBank. Based on the phylogenetic analy- 
sis of the whole-genome sequence, PEDV has been classified into 
two Genogroups—1 and 2—which the variant and classical strains 
of US PEDV belong to, respectively [2]. 

Since the first 454 FLX pyrosequencing platform was intro- 
duced to the market in 2005, next-generation sequencing (NGS) 
has significantly advanced research in diverse fields. NGS has the 
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2 Materials 


2.1 RNA Extraction 
from Feces or 
Intestinal Contents 


2.2 Real-Time 
Reverse Transcriptase 
Polymerase Chain 
Reaction (RT-PCR) 
Reaction 


2.3 Targeted 
Amplification One- 
Step RT-PCR 


2.4 SISPA Method 
(Sequence- 
Independent, Single- 
Primer Amplification) 


advantages of high-throughput and cost-effectiveness. Currently, 
there are several platforms available, including the Genome 
Analyser developed by Ilumina/Solexa, and the Personal Genome 
Machine (PGM) by Ion Torrent. One of Illumina NGS platforms— 
MiSeq—is commonly used in diagnostic laboratories. 

In general, sequencing viruses directly from fecal samples for 
PEDV are technically challenging without prior amplification with 
specific primers. In this chapter, we describe a useful two-pronged 
approach where the random-priming method can be used to 
sequence the complete PEDV genome from samples with Ct val- 
ues of less than 15, whereas the targeted amplification method is 
recommended to be used to sequence clinical fecal samples with 
higher Ct values (low viral loads). 


1. MagMAX Pathogen RNA Kit (Life Technologies) for viral 
RNA extraction from fecal or intestinal contents (see Note 1). 


2. HBSS (GIBCO). 


1. One-Step RT-PCR kit (Qiagen). 

2. Smart Cycler II (Cepheid, Sunnyvale CA). 

3. 10 pmol Primers and probes [3] (see Note 2). 
Forward primer: 5’-CATGGGCTAGCTTTCAGGTC-3’. 
Reverse primer: 5’-CGGCCCATCACAGAAGTAGT-3’. 


Probe: 5'/56-FAM/CATTCTTGGIGGTCT TTCAAT 
CCTGA/ZEN 3IABkEQ/3’. 


1. One-Step RT-PCR kit (Qiagen) (see Note 3). 


2. Oligonucleotide primers dissolved in nuclease-free water to a 
stock concentration of 100 pmol/l and a working concentra- 
tion of 20 pmol/l. The sequences for 19 pairs of primers are 
listed in Table 1. 


3. Qiagen gel purification kit. 
4. Qubit 2.0 Fluorometer (Life Technologies). 


. SuperScript II Reverse Transcriptase kit (Invitrogen). 
. RNase H treatment (NEB). 

. Klenow amplification (NEB). 

. Advantage 2 PCR kit (Clontech). 

. 10 mM dNTP mix (NEB). 

. RNase Inhibitor (Promega). 


nm FF WN FH 


Table 1 


Nineteen pairs of primers for whole-genome amplification of PEDV 
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Fragment no. Sequence Sense Size 

Fl ACTTAAAAAGATTTTCTATCTAC Forward 1622 
CGTTAACGATACTAAGAGTGGC Reverse 

F2 TGGTGACCTTGCAAGTGCAGC Forward 1603 
ATTACCAACAGCCTTATTAAGC Reverse 

F3 ACCATTGACCCAGTTTATAAGG Forward 1587 
ACAAAAGCACTTACAGTGGC Reverse 

F4 TACACCTTTGATTAGTGTTGG Forward 1614 
TITGTAGCGTCTAACTCTAC Reverse 

F5 GTACCAGGTGATCTCAATGTG Forward 1615 
ACGTGGCAATGTCATGGACG Reverse 

F6 ATGCTGCTGTTGCTGAGGCTC Forward 1600 
TCAGTTGAGATAGAGTTGGC Reverse 

F7 GTGACAAGTTCGTAGGCTC Forward 1597 
TAAGTGACAGAACTCACAGG Reverse 

F8 TGCACAAGGTCTTGTTAACATC Forward 1601 
TCTGTGCACCATTAGGAGAATC Reverse 

F9 ACCTGCGTGTAGTCAAGTGG Forward 1599 
GTTACCAGTGGAACACCATC Reverse 

F10 ACTGTGCCAACTTCAATACG Forward 1611 
TCATCAACAAACACACCTGC Reverse 

Fll TGCTCGCAGCATACTATGCAG Forward 1588 
GTGGTGCAGGCAGCTGTTGAG Reverse 

F12 TCTATGTGCACTAATTATGAC Forward 1599 
TGATTGCACAATTCGGCCGC Reverse 

F13 CCATACATGATTGCTTTGTC Forward 1595 
ATCGTCAAGCAGGAGATCC Reverse 

Fl4 TGTCTAGTAATGATAGCACG Forward 1647 
TTATCCCATGTTATGCCGAC Reverse 

F15 TAATGATGTTACAACAGGTCG Forward 1554 
AAGCCATAGATAGTATACTTG Reverse 

F1l6 TGAGTTGATTACTGGCACGCC Forward 1598 


GTACTGTATGTAAAAACAGCAG 


Reverse 


(continued) 
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Table 1 
(continued) 
Fragment no. Sequence Sense Size 
F17 ATCGCAATCTCAGCGTTATG Forward 1596 
GTGTAAACTGCGCTATTACAC Reverse 
F18 CTGCTTATTATAAGCATTAC Forward 1603 
GCTTCTGCTGTTGCTTAAGC Reverse 
F19 AGTCTCGTAACCAGTCCAAG Forward 1065 
TITITTTTITITTGTGTATCCAT Reverse 
7. Oligonucleotide primers [4] dissolved in nuclease-free water 


2.5 Detection of PCR 
Products 


2.6 Illumina Nextera 
DNA Library 
Preparation 


2.7 Next-Generation 
Sequencing 


CAN A wm FF WN FH BR wo NHN FF CC 


Noe 


to a concentration of 50 uM, 1 uM, and 10 uM for P1, P2, and 
P3, respectively. 


P1: GAC CAT CTA GCG ACC TCC ACN NNN NNN N. 
P2: GAC CAT CTA GCG ACC TCC AC TTT TITTTTT 
TITTTTTT TT. 

P3: GAC CAT CTA GCG ACC TCC AC. 


. QTAquick PCR Purification Kit (Qiagen). 


. Agarose. 

. Distilled water. 

. Ethidium bromide. 

. 1x TAE buffer: 40 mM Tris, 20 mM acetic acid, 1 mM EDTA. 


. Nextera XT Library Prep Kit 96 samples (Box | of 2) (Illumina). 
. Nextera XT Library Prep Kit 96 samples (Box 2 of 2) (Illumina). 
. Nextera XT Index Kit 96 indexes-192 samples. 

. Agencourt AMPure XP beads (Beckman Coulter). 

. 96-well PCR plate (Scientific Inc.). 

. 96 Deep Well Block (Invitrogen). 

. Microseal “B” adhesive seals (BioRad). 

. Magnetic plate stand-96 (Life, Technologies). 

. Ethanol, 200 proof (Sigma-Aldrich). 


. MiSeqv2 Reagent Kit 500 cycles PE-Box 1 of 2 (Illumina). 
. MiSeqv2 Reagent Kit Box 2 of 2 (Illumina). 


3. MiSeq (Illumina). 


2.8 Sequence 
Assembly 
and Analysis 


3 Methods 


3.1 Viral RNA 
Extraction 


3.2 Real-Time 
RT-PCR Reaction 


3.3 One-Step RT-PCR 
Reaction 


~ 
j=) 


CONAN F WN 
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. Kraken. 

. Krona. 

. BWA—Burrows-Wheeler Alignment Tool. 
. SAMtools. 

. Picard. 

. BLAST. 

. BioPython. 

. GATK—Genome Analysis Toolkit. 


R. 


. IGV. 


. Fecal or intestinal contents were diluted in HBSS to a final 


concentration of 20 % and were homogenized by five stainless 
steel balls followed by a centrifuge step at 2000 RCF at 4 °C 
for 5 min. 


. The supernatant was used for RNA extraction by using the 


MagMAX Pathogen RNA/DNA Kit (Life Technologies) 
(see Note 2). 


. Real-time RT-PCR with a 25 ul reaction volume was com- 


pleted using QIAGEN one-step RT-PCR kit: 5 pl 5x RT-PCR 
buffer, 0.5 pl forward primer (10 pmol), 0.5 pl reverse primer 
(10 pmol), 0.5 pl probe (10 pmol), 1 pl dNTP, 1 pl enzyme 
mix, 0.2 pl RNasin inhibitor (40 Unit/pl, Promega), and RNA 
temple: 2.5 pl. 


. The amplification conditions were 50 °C for 30 min; 95 °C for 


15 min; and 45 cycles of 94 °C, 15 s, and 60 °C, 45 s. 


. RT-PCR with a 25 ul reaction volume was completed using 


QIAGEN one-step RT-PCR kit: 5 pl 5x RT-PCR buffer, 0.8 pl 
forward primer (20 pmol), 0.8 pl reverse primer (20 pmol) 
(Table 1), 1 pl dNTP, 1 pl enzyme mix, 0.2 pl RNasin inhibitor 
(40 Unit/pl, Promega), and RNA temple: 2.5 ul. 


. The amplification conditions were 50 °C for 30 min; 95 °C for 


15 min; and 45 cycles of 94 °C, 30s, 54 °C, 30 s, and 72 °C, 
1 min 30s. 


. Analyze the PCR products on a 1 % agarose gel and migrate for 


That 90 V. 


. Excise the correct size bands and perform gel purification with 


a Qiagen gel purification kit (see Note 4). 
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3.4 SISPA Method 
(See Note 6) 


3.4.1 First-Strand 
Synthesis 


3.4.2 Klenow 
Amplification 


3.4.3 PCRAmplification 


3.5 Library 
Preparation Using 
Nextera XT Kit 


. Quantify the DNA generated by a fluorescence-based method 


(Qubit 2.0 Fluorometer) and final amount of DNA input as 
1 pg (see Note 5). 


. 1 plof50 pM random primer P1; 1 pl of 1 pM oligo dT primer 


P2; 1 pl 10 mM dNTP mix; 10 pg-5 pg of RNA template. Add 
water up to 13 pl total volume. 


. Incubate the reaction at 65 °C for 5 min and incubate on ice 


for at least 1 min. 


. Add 4 pl 5x first-strand buffer; 1 pl 0.1 M DTT; 1 pl RNase 


inhibitor; 1 pl of SuperScript III Reverse Transcriptase. 


. Incubate the reaction at 25 °C for 5 min, 50 °C for 30-60 min, 


and 70 °C for 15 min. 


. Add 1 pl RNase H (NEB) to the reaction. 
6. Incubate at 37 °C for 20 min. 


. Add 3 pl 10x Klenow reaction buffer; 1 pl of 25 pmol dNTP; 


and 1 ul of 1 uM random primer P1 to the reaction in 
Sect. 3.4.1. 


2. Incubate at 95 °C for 2 min and cool to 4 °C. 
. Add 1 pl Klenow fragment (NEB). 
4. Incubate at 37 °C for 60 min and 75 °C for 20 min. 


. 5 pl 10x Advantage 2 PCR buffer; 1 pl 50x dNTP mix; 2 pl 


10 uM barcode primer P3; 1 pl 50x Advantage 2 Polymerase 
Mix; DNA template from Klenow amplification. Add water up 
to 50 pl total volume. 


. Incubate the reaction using the following PCR program: 1 


cycle: 95 °C 5 min; 5 cycles: 95 °C 1 min; 59 °C 1 min; 68 °C 
1 min 10 s; 33 cycles: 95 °C 20 s; 59 °C 20 s; 68 °C 1 min 
30 s; 1 cycle: 68 °C 10 min. 


. Use 5 pl to analyze the PCR products on a 1 % agarose gel and 


migrate for 1 h at 90 V (see Note 7). 


. Use QIAquick PCR Purification Kit to purify the remaining 


45 ul. 


. Quantify the DNA generated by a fluorescence-based method 


(Qubit 2.0 Fluorometer) and final amount of DNA input as 
1 pg. 


. Perform the library preparation based on the Illumina com- 


pany manual, which includes tagmentation of genomic DNA, 
PCR amplification, PCR cleanup, library normalization, and 
final library pooling for MiSeq sequencing. 


3.6 Sequence 
Assembly 
and Analysis 


Sample: OH851-RP-Virus 

Reference_Set: SECD 

Rl file size: 451M, read count: 914188 
R2 file size: 451M, read count: 914188 


223,955 virus reads --> 24.4977% of total reads 


Alignment stats (reference guided): 
reference us 
Porcine_epidemic_diarrhea_virus-KJ399978 
Porcine _Resp_Corona_Virus-DQ811787 


Porcine_epidenic_diarrhea_virus-KJ399978 
Porcine_Resp_Corona_Virus-DQ611787 
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1. Kraken is used to initially identify raw reads and provide a 


graphical representation of the reads using Krona. A custom 
Kraken database is used. It was built using the standard data- 
base containing all Ref Seq bacteria and virus genomes along 
with all complete swine enteric coronavirus disease (SECD) 
genomes available at NCBI and a pig genome. 


. Raw reads are run through an in-house custom shell script. In 
brief, 18 complete genomes from NCBI representing the 4 
SECD virus species (TGEV, PRCV, PEDV, and PDCoV) are 
used as references to align raw reads. A function is looped 3x. 
This function aligns and removes duplicates, creates a VCF, 
updates reference with VCF information, and performs a 
BLAST search against the nt database using the updated refer- 
ence. Eighteen complete genomes are used to start the initial 
loop. From this first loop the top hit returned is used as the 
reference for the next loop. A total of three loops are per- 
formed to find the best reference. 


. After the best reference has been found, alignment metrics 
including read counts, mean depth of coverage, and percent of 
genome with coverage are collected. 


4. Reports summarizing the alignment metrics (Fig. 1) along 


with Kraken identification interactive Krona HTML file, a 
FASTA of assembled genome, and depth of coverage profile 
graph (Fig. 2) are e-mailed to concerned individuals. 

. The assembled FASTA file can be visually verified in IGV using 
the BAM and VCF output from the script. If necessary the 
FASTA can be corrected in program of choice. 


6. Script details are provided on GitHub (https://github.com/ 


USDA-VS/public/blob/master/secd/idvirus.sh). 


ead count percent cov ave 


r depth 
346,926 98.368 2,764x 


68,978 50.838 713% 


qlength slength % id mis evalue bscore Description 
20025 28029 «99.98 = 
26617 27550 93.00 1654 0.0 38778 PRCV ISU-1, complete genome DQ¢11787.1 


51729 Porcine epidemic diarrhea virus strain OH851, complete genome KJ399978.1 


Fig. 1 Report summary for sample 0OH851-RP-Virus. The reference set used to initiate the shell script was 
SECD (swine enteric coronavirus diseases). File size and read counts for each fastq file are shown. Provided 
by Kraken, 223,955 virus reads were identified. “Reference used” is the closest finding in the NCBI nt data- 
base. The read count shows the number of raw reads shown to match the reference. “Percent cov” shows the 
percent of reference having coverage. A coverage of 98.36 % for PEDV indicates a true find relative to the 
sporadic <51 % coverage seen from PRCV, although in this case, the presence of PRCV cannot be ruled out. 
There were no reads matching TGEV and PDCoV, which were not shown. Because of the high percent of 
genome coverage, the completed reference-guided assembly for PEDV was BLAST against the nt database to 
provide mismatches, e-value, and bit score against the most closely related publicly available genome 
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Fig. 2 The depth of coverage profiles for sample OH851-RP-Virus. The x-axis is the genome position. The 
y-axis is the log depth of coverage. Reads matching any of the four SECD target viral species are shown 


. The MagMAX Pathogen RNA/DNA Kit was used for the 


extraction of nucleic acid for pathogen detection—including 
the detection of TGEV, PEDV, and PDCoV—from pig feces 


. The real-time RT-PCR assay was developed by our laboratory 


and the primers and probes target the M gene of the virus. 


. In our laboratory, the QIAGEN one-step RT-PCR kit has 


been used to amplify RT-PCR products between 100 bp and 


. Alternatively, 5 pl out of 25 pl could be loaded on the gel to 


confirm that each target band is amplified and then the remain- 
ing 20 pl can be purified by QIAquick PCR Purification Kit. 


. A smaller amount of input DNA than the required 1 pg for 


targeted amplification method could be used to avoid over- 


. The SISPA method is recommended when the Ct value of real- 


. When 5 pl was applied to the gel, a smear bank can be observed. 


4 Notes 
or intestinal contents. 
1.8 kb in length. 
whelming sequence reads. 
time RT-PCR is below 15. 
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